# AI Agent

Nelly
Nelly is a complete AI agent platform that allows building, using, and sharing AI agents without coding. It provides features such as building AI agents via natural language, conversing naturally with AI agents, and sharing and marketizing AI agents.
AI Agent
37.0K

Colossal
Colossal provides a global agent directory, allowing users to easily connect and integrate various AI agents capable of executing API calls, thereby simplifying the tool development process. It offers businesses an efficient way to manage and automate common workflows such as customer support, messaging, and order management. Through integration with several well-known platforms (such as Zendesk, Twilio, Slack, etc.), Colossal helps businesses save development time and costs while increasing operational efficiency. It aims to provide commercial users with a one-stop AI agent integration solution. Pricing is yet to be determined but is expected to be based on usage or company size.
API Services
58.2K

Manus
Manus is the world's first truly autonomous AI agent developed by Monica.im, capable of delivering complete task results directly, rather than just providing suggestions or answers. It uses a Multiple Agent architecture, runs in an independent virtual machine, and can complete tasks directly by writing and executing code, browsing the web, and operating applications. Manus achieved SOTA performance in the GAIA benchmark test, demonstrating its powerful task execution capabilities. Its goal is to be users' 'agent' in the digital world, helping users efficiently complete various complex tasks.
Personal Assistance
81.4K

Mahilo
Mahilo is a powerful AI agent integration platform designed to connect AI agents from various frameworks for real-time communication and human oversight. It provides a framework-agnostic communication protocol, supporting various popular agent frameworks like LangGraph, Pydantic AI, etc., while also allowing connection to proprietary agents via API. The platform emphasizes intelligent collaboration, organization-level policy management, and human-centered design, ensuring human control while automating tasks. Mahilo offers a flexible solution for building complex multi-agent systems, suitable for various applications from content creation to emergency response. Currently, Mahilo boasts 251 stars on GitHub and over 500 monthly PyPI downloads, showcasing its popularity within the developer community. Mahilo primarily targets developers and enterprise users, helping them quickly build and deploy multi-agent systems to improve efficiency and foster innovation.
Development & Tools
45.8K

Lemni
Lemni is an AI platform focused on enhancing customer experience, using custom AI agents to help businesses achieve efficient and personalized customer interactions. This product leverages advanced AI technology to quickly respond to customer needs, supports multilingual interaction, and seamlessly integrates with existing tools. Key advantages of Lemni include rapid deployment, high customizability, and powerful automation capabilities. Its goal is to help businesses expand globally while maintaining close contact with customers. Lemni's pricing strategy is flexible and suitable for businesses of all sizes.
Customer service
58.2K
English Picks

Cloudflare AI Agents
Cloudflare AI Agents is a platform built on Cloudflare Workers and Workers AI, designed to help developers build AI agents capable of autonomously executing tasks. The platform provides the `agents-sdk` and other tools enabling developers to quickly create, deploy, and manage AI agents. Key advantages include low latency, high scalability, cost-effectiveness, and support for complex task automation and dynamic decision-making. Cloudflare's globally distributed network and Durable Objects technology provide robust foundational support for AI agents.
Development & Tools
65.7K

Wren AI
Wren AI is an open-source SQL AI agent designed to help data and product teams interact with data through natural language, generating SQL queries, charts, spreadsheets, reports, and BI. It employs a semantic engine architecture to provide business context for large language models (LLMs) and uses Modeling Definition Language to handle metadata, architecture, terminology, and the logic behind calculations and aggregations, generating accurate SQL queries with semantic context. Key benefits of Wren AI include ease of use, security and reliability, open-source access, support for various data sources and analytical tools like BigQuery, DuckDB, and PostgreSQL, along with integration capabilities for popular tools like Excel and Google Sheets. It also supports various LLM models, regardless of whether they are hosted in the cloud or on-premises. Wren AI is positioned as a powerful tool for data teams to enhance data access and analytical efficiency.
Data Analysis
62.4K

Brushedit
BrushEdit is an advanced, unified AI agent for image restoration and editing. It combines multimodal large language models (MLLMs) and image restoration models to achieve automated, user-friendly, and interactive free-form instruction editing. The system employs MLLMs and a dual-branch image restoration framework to perform editing category classification, primary object recognition, mask generation, and area restoration. Extensive experiments show that this framework effectively integrates MLLMs and restoration models, achieving superior performance across seven key metrics, including mask area retention and editing effect consistency.
AI design tools
49.1K
English Picks

Elevenlabs Conversational AI
ElevenLabs Conversational AI is a voice agent product that can be rapidly deployed on websites, mobile devices, or phones. It features low latency, full configurability, and seamless scalability, supporting turn-taking and interruption handling in natural conversations, making it suitable for unpredictable dialogues in noisy environments. The product combines speech-to-text, large language models (LLM), and text-to-speech technologies, supporting multiple languages and customizable voices for various scenarios including customer support, scheduling, and outbound sales.
Chatbot
61.3K

Userfeedchat
UserFeedChat is an AI user research tool that allows users to request features and report bugs to an AI agent through natural conversations, revealing genuine user insights. The tool provides users with key user pain points and frustrations through daily and weekly reports, helping businesses better understand user needs and optimize their products. UserFeedChat protects user data, ensuring that all conversation data is visible only to the business and complies with storage regulations. The product background indicates that UserFeedChat aims to reduce the time and hassle of conducting interviews by automating user research, while providing deeper user understanding.
User Research Tools
48.3K

Integuru
Integuru is an AI agent capable of generating integration code for third-party platforms using reverse engineering technology. By analyzing network requests and user actions in the browser, it automatically generates Python code that can trigger specific actions. This technology is crucial as it allows developers to quickly build integration solutions without needing an in-depth understanding of the internal APIs of third-party platforms, thereby increasing development efficiency and lowering technical barriers. Integuru is developed by Integuru.ai and is an open-source project that supports custom requests and additional features.
Development & Tools
327.6K
Fresh Picks

TEN Agent
TEN Agent is an innovative multimodal AI agent that integrates OpenAI's real-time API to provide users with a powerful interactive platform. This product represents the latest advancements in artificial intelligence for multimodal interactions, capable of understanding text while also processing image and audio data types. The key advantages of TEN Agent lie in its high level of integration and real-time capabilities, offering users quick and accurate feedback, significantly enhancing efficiency and user experience. The product background indicates that TEN Agent aims to advance productivity tools through cutting-edge AI technology and is currently in the beta testing phase. Regarding pricing and positioning, TEN Agent may offer a free trial to attract early users and gather feedback for further product optimization.
Personal Assistance
88.0K

Talkstack AI
Talkstack AI is a platform that utilizes artificial intelligence technology to provide customer support and sales agent services. With AI agents, it can perform complex tasks in multiple languages, supports text and voice communication, and offers enterprise-level security. Key advantages of the product include no need for pre-recorded audio or trigger words, fully AI-generated voice responses, and the capability to scale sales and operational teams. It also supports the creation of custom workflows and makes it easy to review the accuracy of responses generated by the AI agent.
AI customer service assistant
48.3K
Fresh Picks

Deepgram Voice Agent API
The Deepgram Voice Agent API is a unified voice-to-voice API that enables natural-sounding conversations between humans and machines. This API is backed by industry-leading speech recognition and synthesis models that allow for natural and real-time listening, thinking, and speaking. Deepgram is committed to advancing a voice-first AI future through its agent API, integrating cutting-edge generative AI technology to create business solutions with smooth, human-like speech agents.
AI speech recognition
61.8K

AIGC Tool Navigation
AIGC Tool Navigation is a platform dedicated to generative artificial intelligence, offering a diverse range of AI tools including AI writing, AI drawing, AI design, AI office, AI video, AI voice, AI music, AI thesis, AI resume, AI digital human, AI Agent, text-to-speech, and more. The platform covers popular AI tools such as Xiaohongshu copy generator, Toast AI, AIPPT, ChatPPT, etc., aiming to provide users with a one-stop search and usage experience for AI tools, enhancing work efficiency and creativity.
AI information platform
50.2K
English Picks

Agent Q
Agent Q is the next-generation AI agent model developed by MultiOn. By integrating search, self-criticism, and reinforcement learning, it creates advanced autonomous web agents capable of planning and self-repair. It addresses the challenges of traditional large language models (LLMs) in multi-step reasoning tasks within dynamic environments, enhancing success rates in complex scenarios using guided Monte Carlo Tree Search (MCTS), AI self-criticism, and Direct Preference Optimization (DPO) algorithms.
AI Agents
55.5K

Bardeenai
Bardeen AI is an AI agent that executes repetitive tasks with simple prompts, designed to streamline workflows and enhance efficiency. It integrates with various applications and browsers to complete tasks securely and reliably. The main advantages of Bardeen AI include no need for programming or technical background, operation through simple language commands, real-time confirmation of action plans, and continuous task execution in the background. It supports multiple integrations such as Google Sheets, Slack, LinkedIn, making it suitable for various scenarios like sales, recruitment, and market research.
Automated Workflow
47.7K

Amabay
Amabay is an AI-driven Q&A platform that enables users to create their own Amabot— a personalized AI agent to respond to queries. Leveraging RAG technology, it generates accurate and objective answers, providing users with a new way to present themselves and communicate. Amabay is suitable for individuals and organizations looking to enhance online interaction. Currently, Amabay offers free services, though specific pricing strategies and positioning remain undefined.
Chatbot
48.3K

Fluidworks
Fluidworks leverages AI agents to deliver real-time video demos, enhancing customer engagement, driving sales efficiency, optimizing sales team focus, and providing data-driven insights to refine sales strategies. Through personalized, real-time demonstrations and instant Q&A, it offers a customized experience for customers, allowing them easy access to demos, ensuring consistency and reliability of information, and ultimately helping them make informed purchasing decisions.
Sales
49.1K
Fresh Picks

Scoopika
Scoopika is an open-source developer platform designed to empower developers to build personalized AI agents that can see, speak, hear, learn, and take action. It provides a secure, efficient, and user-friendly platform for the AI era, supporting full edge compatibility and real-time streaming. Built-in visual and voice chat functionality enhances user interaction. Scoopika emphasizes its open-source nature, offering server-side and client-side runtimes, as well as integration modules for React projects, fostering a vibrant and growing developer community.
Development Platform
53.3K

Real Time Voice AI Agent
Real-time Voice AI Agent is a highly flexible real-time voice interaction model capable of answering any query via voice in approximately 500 milliseconds. The model supports the user's selection of any large language model, text-to-speech (TTS) model, and speech-to-text (STT) model. It is ideal for applications involving voice, such as customer service robots and receptionists.
AI voice assistant
70.1K
Fresh Picks

Agently Daily News Collector
Agently Daily News Collector is an open-source project built on the Agently AI application development framework, capable of automatically collecting news on specific topics. Users simply input the field of news they want collected, and the AI agent will automatically work until it generates and saves a high-quality collection of news in Markdown files.
AI News
82.2K

Finrobot
FinRobot is an open-source AI agent platform that leverages large language models (LLMs) to provide comprehensive solutions for financial applications. It integrates various AI technologies, going beyond simple language modeling to showcase its versatility and adaptability, catering to the diverse needs of the financial industry. The concept of AI agents in FinRobot refers to intelligent entities that use large language models as their 'brains' to perceive the environment, make decisions, and execute actions. Unlike traditional AI, AI agents possess the ability to think independently and utilize tools to progressively achieve given objectives.
AI Agents
104.6K

Octoverse
Octoverse is an AI agent model designed to help developers build AI companions within applications that can understand and complete tasks. It's four times faster and ten times less expensive for function calls compared to GPT-4, while achieving higher accuracy. Through advancements in model specialization, Octoverse provides a significant leap towards sustainable, accessible, and user-friendly AI applications, addressing issues of privacy, cost, and latency.
Development & Tools
45.5K

Alice
Alice is a lightweight AI agent designed to create a self-contained AI assistant similar to JARVIS. It achieves this by building a "text computer" centered around a large language model (LLM). Alice excels in tasks like topic research, coding, system administration, literature reviews, and complex mixed tasks that go beyond these basic capabilities. Alice has achieved near-perfect performance in everyday tasks using GPT-4 and is leveraging the latest open-source models for practical application.
AI Agents
458.4K

V IRL
V-IRL utilizes existing mapping technologies and street view image APIs to empower researchers to deploy AI agents in virtual replicas of locations worldwide. These agents are capable of performing diverse tasks such as navigation, location recognition, and service recommendation, all based on the data they 'see' and 'comprehend' in the virtual environment. Simply put, V-IRL allows AI to train and operate in a virtual, real-world data-based environment, with the aim of enhancing its ability to tackle real-world problems. Through testing and optimizing AI models in such an environment, V-IRL offers a practical, efficient, and low-cost platform for AI research and application.
AI Agents
54.1K

Pokéllmon
POKéLLMON is the first LLM-based agent to achieve human-level performance in a tactical combat game. It leverages three key strategies: 1) Contextual reinforcement learning, which uses text-based feedback from battles to iteratively optimize its generation strategy; 2) Knowledge-enhanced generation, which utilizes external knowledge to combat hallucinations and enable the agent to act accurately and timely; 3) Self-consistent action generation, which mitigates panicking switching behavior when facing strong opponents and avoiding combat. Battles against human players demonstrate POKéLLMON's human-level fighting ability and strategy, achieving a 49% win rate in tournament matches and a 56% win rate in invitational matches. Furthermore, it reveals vulnerabilities in human players' exhaustion strategies and deceptive tactics.
AI Game Creation
50.8K

Monoid
Monoid enables APIs to become actionable, enhancing LLMs' ability to access relevant context and act on behalf of users. You can create agents in minutes by selecting a base LLM, agent type, and actions. Simply provide your API, choose AI agent control parameters, and simulate AI agent usage of your API with natural language responses. You can also converse with your agents and share your actions and agents on the Hub, helping to create a vibrant network of actions and agents.
Development & Tools
47.5K

Llmonitor
LLMonitor is a platform that provides observability, analysis, and testing for LLM (Large Language Model) applications. It can record LLM call logs, metrics, and traces, support conversational evaluations and chat record playback, helping optimize the performance and cost control of AI applications. LLMonitor offers features like log monitoring, performance analysis, error tracking, user conversation recording, and user feedback collection. It is suitable for various AI development scenarios, including agents and chatbots.
Development & Tools
59.9K

Chatdev IDE: Building Your AI Agent
ChatDev IDE is a chat development environment that seamlessly connects different agents in various web browsers. It includes game mode, chat mode, and Prompt IDE. You can personalize these NPCs, customize location prompts, and build your GPTs with a visual prompt editor. Supports importing models from the GPTs community or defining your own models. JavaScript support accelerates the prompt engineering process. In addition to ChatGpt, it supports over 10 open-source models like Bing Chat, Google Bard, Claude, QianWen, and iFlytek Spark. You can freely download and install the ChatDev IDE plugin.
AI development assistant
74.8K
- 1
- 2
Featured AI Tools

Flow AI
Flow is an AI-driven movie-making tool designed for creators, utilizing Google DeepMind's advanced models to allow users to easily create excellent movie clips, scenes, and stories. The tool provides a seamless creative experience, supporting user-defined assets or generating content within Flow. In terms of pricing, the Google AI Pro and Google AI Ultra plans offer different functionalities suitable for various user needs.
Video Production
42.0K

Nocode
NoCode is a platform that requires no programming experience, allowing users to quickly generate applications by describing their ideas in natural language, aiming to lower development barriers so more people can realize their ideas. The platform provides real-time previews and one-click deployment features, making it very suitable for non-technical users to turn their ideas into reality.
Development Platform
44.4K

Listenhub
ListenHub is a lightweight AI podcast generation tool that supports both Chinese and English. Based on cutting-edge AI technology, it can quickly generate podcast content of interest to users. Its main advantages include natural dialogue and ultra-realistic voice effects, allowing users to enjoy high-quality auditory experiences anytime and anywhere. ListenHub not only improves the speed of content generation but also offers compatibility with mobile devices, making it convenient for users to use in different settings. The product is positioned as an efficient information acquisition tool, suitable for the needs of a wide range of listeners.
AI
41.7K

Minimax Agent
MiniMax Agent is an intelligent AI companion that adopts the latest multimodal technology. The MCP multi-agent collaboration enables AI teams to efficiently solve complex problems. It provides features such as instant answers, visual analysis, and voice interaction, which can increase productivity by 10 times.
Multimodal technology
42.8K
Chinese Picks

Tencent Hunyuan Image 2.0
Tencent Hunyuan Image 2.0 is Tencent's latest released AI image generation model, significantly improving generation speed and image quality. With a super-high compression ratio codec and new diffusion architecture, image generation speed can reach milliseconds, avoiding the waiting time of traditional generation. At the same time, the model improves the realism and detail representation of images through the combination of reinforcement learning algorithms and human aesthetic knowledge, suitable for professional users such as designers and creators.
Image Generation
41.4K

Openmemory MCP
OpenMemory is an open-source personal memory layer that provides private, portable memory management for large language models (LLMs). It ensures users have full control over their data, maintaining its security when building AI applications. This project supports Docker, Python, and Node.js, making it suitable for developers seeking personalized AI experiences. OpenMemory is particularly suited for users who wish to use AI without revealing personal information.
open source
42.0K

Fastvlm
FastVLM is an efficient visual encoding model designed specifically for visual language models. It uses the innovative FastViTHD hybrid visual encoder to reduce the time required for encoding high-resolution images and the number of output tokens, resulting in excellent performance in both speed and accuracy. FastVLM is primarily positioned to provide developers with powerful visual language processing capabilities, applicable to various scenarios, particularly performing excellently on mobile devices that require rapid response.
Image Processing
41.1K
Chinese Picks

Liblibai
LiblibAI is a leading Chinese AI creative platform offering powerful AI creative tools to help creators bring their imagination to life. The platform provides a vast library of free AI creative models, allowing users to search and utilize these models for image, text, and audio creations. Users can also train their own AI models on the platform. Focused on the diverse needs of creators, LiblibAI is committed to creating inclusive conditions and serving the creative industry, ensuring that everyone can enjoy the joy of creation.
AI Model
6.9M